Biological sequence analysis using regular expressions.
نویسنده
چکیده
“Regular expressions”, also known as “RegExps” or sometimes as “grep patterns”, are a way of specifying patterns of characters within a text file or longer string of characters. Regular expressions can contain various “wild cards” or sets of alternatives, which can be used to include ambiguity in the pattern to be matched. One common use is to perform sophisticated find-and-replace operations.
منابع مشابه
Regmex, Motif analysis in ranked lists of sequences
Motif analysis has long been an important method to characterize biological functionality and the current growth of sequencing-based genomics experiments further extends its potential. These diverse experiments often generate sequence lists ranked by some functional property. There is therefore a growing need for motif analysis methods that can exploit this coupled data structure and be tailore...
متن کاملFine Classification & Recognition of Hand Written Devnagari Characters with Regular Expressions & Minimum Edit Distance Method
Regular expressions are extremely useful, because they allow us to work with text in terms of patterns. They are considered the most sophisticated means of performing operations such as string searching, manipulation, validation, and formatting in all applications that deal with text data. Character recognition problem scenarios in sequence analysis that are ideally suited for the application o...
متن کاملCategorizing Event Sequences Using Regular Expressions
IASSIST Quarterly Introduction Researchers who work with large sequential datasets are often limited in the kinds of analytic strategies they can use because of the sheer size of the data. Automated techniques for analyzing sequences were developed in the 1960s by scientists studying DNA, RNA, and proteins. In a classic volume on sequence analysis, Sankoff and Kruskal (1983) demonstrated its po...
متن کاملCategorizing Event Sequences Using Regular Expressions
Introduction Researchers who work with large sequential datasets are often limited in the kinds of analytic strategies they can use because of the sheer size of the data. Automated techniques for analyzing sequences were developed in the 1960s by scientists studying DNA, RNA, and proteins. In a classic volume on sequence analysis, Sankoff and Kruskal (1983) demonstrated its potential applicatio...
متن کاملGenRGenS: software for generating random genomic sequences and structures
SUMMARY GenRGenS is a software tool dedicated to randomly generating genomic sequences and structures. It handles several classes of models useful for sequence analysis, such as Markov chains, hidden Markov models, weighted context-free grammars, regular expressions and PROSITE expressions. GenRGenS is the only program that can handle weighted context-free grammars, thus allowing the user to mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- BioTechniques
دوره 27 1 شماره
صفحات -
تاریخ انتشار 1999